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Abstract: In collider physics, jet algorithms are a ubiquitous tool for clustering particles 
into discrete jet objects. Event shapes offer an alternative way to characterize jets, and 
one can define a jet multiplicity event shape, which can take on fractional values, using the 
framework of “jets without jets”. In this paper, we perform the first analytic studies of 
fractional jet multiplicity Aj e t in the context of e + e _ collisions. We use fixed-order QCD to 
understand the Aj et cross section at order a 2 s , and we introduce a candidate factorization 
theorem to capture certain higher-order effects. The resulting distributions have a hybrid jet 
algorithm/event shape behavior which agrees with parton shower Monte Carlo generators. 
The Ajet observable does not satisfy ordinary soft-collinear factorization, and the Aj e t cross 
section exhibits a number of unique features, including the absence of collinear logarithms 
and the presence of soft logarithms that are purely non-global. Additionally, we find novel 
divergences connected to the energy sharing between emissions, which are reminiscent of 
rapidity divergences encountered in other applications. Given these interesting properties of 
fractional jet multiplicity, we advocate for future measurements and calculations of Aj e t at 
hadron colliders like the LHC. 
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1 Introduction 


For almost forty years, we have known that high energy particle collisions can produce jets 
[1, 2], The term “jets” has two related but different meanings: “jet formation” is the physical 
process by which quarks and gluons shower and fragment into collimated sprays of hadrons 
[3], while “jet algorithms” are an analysis technique used to cluster those hadrons into proxies 
for the underlying quarks and gluons [4]. Jet algorithms are a powerful way to categorize 
and organize collision events [5, 6], but event shapes (and jet shapes) offer a more sensitive 
probe of jet formation itself [7]. Indeed, though the observation of three-jet structure in 
e + e _ collisions strongly hinted at the existence of gluons [8, 9], an unambiguous discovery at 
PETRA [10-13] required the use of event shapes like thrust [14] and oblateness [11]. 

Recently, the distinction between jet algorithms and event shapes was blurred through 
the “jets without jets” framework, in which standard jet-based observables are mapped into 
jet-like event shapes [15]. These observables incorporate a transverse momentum threshold 
PTc ut and a jet radius R just like traditional jet algorithms, but they behave more like event 
shapes because they involve an inclusive sum over particles in an event and do not uniquely 
assign hadrons to jet objects. A particularly interesting jets-without-jets observable is jet 
multiplicity, 

Ajet(PTcut, R) = PTl ® (PTi,R ~ PTcut), (1-1) 

PTi ’ R 

where pti,r is the transverse momentum contained in a cone of radius R around particle 
i. By design, this observable is highly correlated with the standard jet count (for the same 
pTcut and R values). Crucially, Aj et can yield fractional values, offering a new probe of jet 
formation. 

In this paper, we initiate the analytic study of fractional jet multiplicity. For simplicity, 
we treat the case of e + e“ —> jets, though we briefly mention how to adapt our calculational 
techniques to collisions at the Large Hadron Collider (LHC). With two or three partons in 
the final state (e + e _ —> qq or qqg), Nj et always yields an integer value. Fractional jets only 
start appearing with four or more partons (e.g. e + e _ — > qqgg ), so our analytic calculations 
will start at This is a general feature of fractional jets: non-integer values only appear 

with three or more particles in a given jet region. 1 Our main technical results will be obtained 
using fixed-order perturbative QCD, though we will also discuss connections to factorization 
properties in soft-collinear effective theory (SCET) [16-20]. 

We will focus on the behavior of Aj et in the vicinity of dijet configurations, 

A2+ = Ajet — 2, A2- = 2 — Ajet, (1.2) 

though we do present some results for A 3 _ = 3 — Nj e t as well. This near-integer behavior 
exhibits a number of curious analytic features, which are already visible at O(otg'). 

1 For e + e~ collisions in the rest frame, momentum conservation prevents all particles from being in the 
same jet region. Hence four partons are required for non-integer values. 
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• Hybrid jet algorithm/event shape behavior. Jet algorithms have a finite cross section at 
(integer) jet multiplicities whereas event shapes typically have form factors that suppress 
the cross section at singular values. The A r j et distribution has both types of behavior. 
Even though the Born-level process e + e~ — > qq gives the integer value iVj e t = 2, one 
might naively expect the corresponding spike at A 2 ± = 0 to be completely smeared out 
by multiple emissions that generate finite values of A 2 ±. Instead, the spike at Ao± = 0 
is robust, as there is a finite region of the many-body phase space that still gives rise to 
integer values of A r j et . At the same time, the appearance of logarithms in A 2 ± at every 
perturbative order generates a shoulder at finite values of A 2 ±, which is suppressed 
as A 2 ± —> 0. Thus, the cross section at exact-integer values of IVj et behave like a jet 
algorithm while near-integer values behave like an event shape (see Sec. 2.1). 

• Cancellation of single- and double-soft divergences. The first non-trivial contributions 
to non-integer behavior of A(j eti arise from configurations where three partons are within 
a mutual radius of 2 R but not within R (see Sec. 2.2). For A 2 +, this three-parton phase 
space has singularities when one or two of the partons goes soft. These divergences 
arise because the observable receives parametrically equivalent contributions from the 
single- and double-soft regions, which are not regulated in dimensional regularization. 
Interestingly, these divergences are structurally similar to rapidity divergences, and we 
will introduce the analog of rapidity regularization [21, 22] to see that the single-soft 
and double-soft divergences do indeed cancel (see Sec. 3). We note that soft emissions 
contribute in a non-linear way, and therefore A(j e t is a non-additive observable. 

• No collinear logarithms. Typical event shapes have singularities in both the soft and 
collinear limit, giving rise to both soft and collinear logarithms. The resulting double- 
logarithmic structure appears as Sudakov form factors in the cross section. By contrast, 
collinear emissions do not generate logarithms of A/] et , and only soft logarithms appear 
in the IVj et distribution (see Sec. 2. 2). * 2 Thus, the suppression in the A 2 ± —> 0 limit is 
only single-logarithmic. 

• Non-global yet local structure. The A 2 ± cross section does not satisfy ordinary soft- 
collinear factorization, because the coefficients of the soft logarithms depend on the 
collinear structure of the jets (despite the absence of collinear logarithms). Furthermore, 
the soft logarithms in Nj et are purely non-global [26, 27], in the sense that they arise 
from soft emissions in a restricted region of phase space (see Sec. 2. 3). 3 These features 
would seem to preclude any standard factorization theorem, especially given the non¬ 
additive nature of Ajet,- In order to change the value of Aj e t, however, soft emissions 
must lie within < 2 R from the hard core of the jet. Thus, color coherence ensures that 

2 Mass-dropped observables [23-25] have the reverse behavior of only having collinear logarithms. 

2 As explained in Sec. 2.3, our use of the term “non-global” can refer to both non-Abelian and Abelian 

correlations. 


- 3 - 



the dependence on A 2 ±, albeit non-global, is local to each jet region. We will present a 
candidate factorization theorem that exploits this local structure (see Sec. 5). 

In addition to analytic studies, we will test iVj et using high-statistics Monte Carlo samples 
from Pythia 8 [28] and Herwig-I— b [29]. Within theoretical uncertainties, the Monte Carlo 
results confirm our analytic understanding. 

Given its many features and potential applications, IVj e t would be very interesting to 
measure at the LHC. As mentioned above, IVjet is a purely non-global observable, with the 
near-integer behavior determined only by soft and not by collinear divergences. To our knowl¬ 
edge, it is the only jet or event shape observable with this behavior. As such, it is a unique 
probe of soft physics, and measurements of IVjet can be used to test color coherence, under¬ 
lying event models, and pileup mitigation strategies. Furthermore, IVjet is useful basis to 
compare parton shower predictions for jet substructure, and we present an initial compari¬ 
son in this work. For new physics searches involving high-multiplicity final states, fractional 
IVj e t values interpolate between different jet multiplicities, obviating the need for exclusive 
jet bins. This interpolation also makes for an interesting version of the classic “staircase” 
plots for vector boson plus N jet production [30-38]. Finally, for the growing field of matrix 
element/parton shower matching/merging [39-50], IVjet has a continuous distribution unlike 
standard jet algorithms and a huge dynamic range compared to standard event shapes, so 
IVj e t can be used to test whether matching/merging procedures achieve a smooth interpola¬ 
tion, even in the soft regime. We note, of course, that theoretical calculations of IVj et for 
hadronic collisions must contend with additional effects such as the underlying event and 
pileup contamination. Moreover, it will be non-trivial to include non-perturbative effects, 
power-suppressed terms, and higher-order perturbative effects such as the resummation of 
non-global logarithms [26, 27, 51-54]. Although a detailed study of IVjet f° r the LHC is be¬ 
yond the scope of this paper, we briefly discuss some of these issues and potential solutions 
in Sec. 6. 

The rest of this paper is organized as follows. In Sec. 2, we review the basic physics 
behind IVj et and explain the kinematic regimes that give rise to fractional jets. In Sec. 3, we 
discuss the structure of rapidity-like divergences and how they appear in the IVjet calculation. 
In Sec. 4, we perform fixed-order calculations of A 2 ± at 0(o^), using both the full e + e~ —>• 4 
parton matrix element as well as a 1 —> 3 splitting function approximation. We then present 
a candidate factorization theorem for A 2 ± in Sec. 5, which includes a renormalization-group- 
independent “collinear function”. In Sec. 6, we briefly discuss how to extend our results to 
the LHC. We compare our analytic calculations to Pythia 8 and Herwig-I— b in Sec. 7, and 
we conclude in Sec. 8. The appendices contain further calculational results and details. 

2 Aspects of Fractional Jets 

Since we will be looking at electron-positron collisions, it is more natural to work with a 
variant of IVjet based on energies and angles (instead of transverse momenta and azimuth- 


- 4 - 


Monte Carlo Distributions 



Monte Carlo Distributions 



tfjet 


(b) 


Figure 1. The 7Vj et distribution in e + e _ collisions from Pythia 8 and HerwigH —b, over a wide 
range of multiplicities (left plot) and for 2 < JVj e t < 3 (right plot). The distribution peaks at integer 
values, and near-integer values are enhanced relative to intermediate values between integers. It is 
this enhancement that we will understand quantitatively through our calculations. 


rapidity distances): 

Njet(Ecut ,R)= ~ET~ - Ecut), ( 2 . 1 ) 

— R 

i G event 

where E% is the energy of particle i, 

Ei,R= EjGiR-dij), (2.2) 

j’G event 

and 0{j is the opening angle between particles i and j. 

For particles whose angular separation is always larger than R, Ei/Ei y R reduces to 1 , and 
iVj e t simply counts the number of hard particles above the energy threshold E cut . Because 
iVj e t is an infrared/collinear (IRC) safe observable, Vj e t always takes on integer values in the 
extreme soft and collinear limit. We will use the notation 

A n± = ±(7Vjet - n) (2.3) 

to indicate the near-integer behavior, with A n + (A n _) parameterizing the IVjet behavior just 
above (below) n. Our calculations will focus on A 2 ± and A 3 _ in e + e _ collisions. 

2.1 Hybrid Jet Algorithm/Event Shape Behavior 

The IVjet distribution is peaked at integer values, with substantial support in the near-integer 
regime. These different parts of the IVj e t distribution can be qualitatively studied with Monte 
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Figure 2. The same iVj e t distributions as Fig. 1, now plotted in “triptych” form to show the near¬ 
integer behavior in A 2 _, A 2 +, and A 3 _. Note that the A 2 _ and A 3 _ axes run backwards. The 
A 2 _ and A 2 + distributions interface at IVj et = 2 (where A 2 ± = 0), indicated by the left-hand dashed 
vertical line. Since we do not plot A 2 ± all the way to 0, we put a gap over the region around jVj et = 2. 
The A 2 + and A 3 _ distributions are connected at Nj et = 2.5 (where A 2 + = A 3 _ = 0.5), indicated by 
the right-hand dashed vertical line. 


Carlo generators. 4 We generate events for e + e~ —> hadrons at a center-of-mass energy of 
Q = 500 GeV in Pythia 8.183 [28] and Herwig-| —b 2.7.1 [29], including showering and 
hadronization. In Fig. la, we plot the distribution of IVjet across a wide range of values, 
showing the circus-tent-like peak and fall-off behavior of the cross section. 5 In Fig. lb, 
we focus on the distribution in the range 2 < IVj e t < 3, which is the region we aim to 
quantitatively describe in this paper. In Fig. 2, we plot the IVj et distribution in a “triptych” 
form that shows in more detail the near-integer behavior in A 2 -, A 2 +, and A 3 _, in particular 
the logarithmic enhancement as A n ± —> 0. 

As mentioned in the introduction, the cross section at exact integer values has a different 
behavior than at near-integer values, a feature related to the hybrid jet algorithm/event shape 
nature of the observable. Like a jet algorithm, the cross section at integer values IVjet = n 
has a non-zero value. Like an event shape, the non-integer cross section is suppressed by 

4 The original Njet variable in Eq. (1.1) is available through the Jets Without Jets package, an add-on to 
FastJet 3 [55] contained in the FastJet CONTRIB library (http://fastjet.hepforge.org/contrib/). The 
variant in Eq. (2.1) is available from the authors upon request. 

5 The peak at 7V) e t = 1 in Pythia 8 seems to arise from events with r leptons produced in hadron decays, 
where the resulting neutrinos carry away a substantial fraction of the jet momentum. The same feature is not 
visible in HerwigH —f, nor is it visible when hadronization effects are turned off in Pythia 8. 
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Figure 3. The same A/j et distributions as Fig. 1, now plotted in the very near vicinity of 7Vj e t = 2. 
Note the scale on the x-axis and the fact that the y -axis is logarithmic on the left plot but linear on 
the right one. At A/j et = 2, there is a spike in the distribution from the jet algorithm behavior of the 
fractional jet multiplicity. At non-integer values, the continuous distribution is more similar to event 
shapes. We show the near-integer values in two different ranges, and for the closer range (right plot) 
one can see bumps in the Monte Carlo distributions from multiple emission and hadronization effects. 


all-orders emissions as A r j et —y n. The behavior can be seen in Fig. 3, where we plot the 
distribution in the very near vicinity of A(j e t = 2. 

The reason why integer values Nj e t = n have a finite cross section is that they receive 
contributions from regions of phase space with non-zero measure. This can be seen easily at 
0(a s ) for e + e _ —> qqg, where the entire cross section lies at Nj et = 2 or 3. More generally, 
any collection of particles within a mutual radius of R will give an integer contribution to 
A/j e t. 6 This means that the differential distribution at integer values has the form 

-4^(iV je t = n) oc 8{n) . (2.4) 

d A/jet 

In contrast, the distribution for near-integer values A n y > 0 is dominated by logarithms 
of A n ±. We will later show that the logarithms of A n ± arise from soft emissions, and the 
most important terms scale single logarithmically as (a s In A n ±) k . These logarithms combine 
at all orders to suppress the cross section as A n ± — > 0, leading to the disjoint behavior at 
integer A/j e t. Note that single-logarithmic suppression is not as strong as the more familiar 
double-logarithmic suppression, so there are no Sudakov peaks in the A n ± distributions. 

6 Regions within mutual radius R are bounded by curves of constant width R. A circle of radius R/2 is the 
simplest example, but there are more general examples like the Reuleaux triangle or the Canadian loonie. 
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Figure 4. Schematic representation of the different phase space configurations with three merged 
partons. The numbered line shows the corresponding range of 7Vj e t■ Here, we are considering just a 
single jet region; an e + e _ —>• dijet event has two such jet regions. Circles represent cones of radius R, 
large blue dots represent energetic partons, and small red dots represent soft partons with z < z cut . The 
near-integer regions Ai_, Ai + , and A 2 - get contributions from single- and/or double-soft emissions. 


2.2 Soft and Collinear Limits of Fractional Jets 

To understand the leading near-integer behavior of jVjet, we can focus on the soft and collinear 
limits. As discussed further in Sec. 5, the all-orders structure of the cross section does not 
satisfy standard soft-collinear factorization, but a soft-collinear analysis is still fruitful at 
fixed order. The essential physics appears already in the three-parton phase space, where the 
near-integer behavior is dominated by soft physics. We will focus on a single jet region here; 
to describe e + e~ —> dijets, we sum over the contributions to iVj et from both jet regions (see 
Eq. (5.2)). 

We call a group of particles merged if their contribution to N- ]eX is connected, such that 
removing a subset of particles affects the contribution to Aj et from other particles. For a single 
isolated particle, its contribution to A/j et is 1. For a merged pair of particles with separation 
less than R, one still obtains an integer value of A r j e t : 


JVjet= 


Ex 


+ 


E2 


E\ + E 2 Ei + E 2 


= 1. 


(2.5) 


Because a single soft/collinear emission does not contribute to the value of A r j e t, it is not 
linear in soft and collinear momenta in the singular region of phase space, and hence it is a 
non-additive observable. 

Now consider three merged partons. As shown in Fig. 4, there are various different phase 
space configurations that lead to different values of Nj et . To achieve non-integer values, one 
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needs a phase space configuration with 


#12 > R, 013 ? 023 < R, 


( 2 . 6 ) 


or permutations of the parton labels. If all Ej > E cut , then the contribution to the jet 
multiplicity is 


N- = El + E2 + E ' 3 

J E± + E 3 E 2 + E 3 E\ + E'2 + E 3 


e (1,2). 


(2.7) 


In terms of energy fractions 


Ej 

E\ + E 2 + E 3 


Z1 + Z2 + Z3 = 1, 


_ E cut 
Zcut = Ei + E 2 + E 3 


( 2 . 8 ) 


which removes the overall energy scale of the jet as a degree of freedom, the observable takes 
the value 


*jet = 


Zl , 

-T- 1 -T-^ z 3 

Zl + ^3 Z 2 + z 3 


(2.9) 


For this merged triplet, near-integer behavior is obtained when one or two of the partons 
goes soft, as shown in Fig. 4. In these soft limits, the observable may depend on a single soft 
momentum or a product of soft momenta. For example, if particle 1 is soft, 


IVjet -> 1 + Zl Z2) + O(zf) . (2.10) 

J z 1 < 1 1 - Z 2 

Or, if particles 1 and 2 are both soft, 

IVjet -^ 1 + 2ziZ2 + O (^i 2 ) • (2-11) 

Zl, 2 <l 


The first case is typical of event shape observables, as the near-integer behavior is linear 
in the soft particle’s energy. The second case, however, is unusual—it goes as the product 
of soft momenta, another demonstration of the non-additive nature of IVjet- This feature 
creates a novel structure in the perturbative series, reminiscent of rapidity divergences in 
SCETn [ 21 , 56]. We discuss this further in Sec. 3, using a toy observable A = Z 1 Z 2 that 
exhibits the same analytic features. 

With more emissions, near-integer behavior requires the soft limit of one or more parti¬ 
cles.' As mentioned above, an arbitrary collection of energetic particles will yield an integer 
IVj e t if the particles can be grouped into merged clusters where each particle in the clus¬ 
ter is within R of all other particles in the cluster (such that each cluster has IVjet = !)• 
Non-integer values are obtained when this is not satisfied (as in the merged triplet example 
above), though generically the resulting values are far from integers. Near-integer values are 
obtained by starting from an integer IVjet configuration and then adding any number of soft 

'We are neglecting special configurations of energetic particles where the value of iVjet happens to be 
near-integer, since those regions of phase space are power suppressed. 
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particles which are not within the mutual radius R of the cluster. The contribution of these 
soft particles to iVjet will be suppressed by 

£soft 
Esoit ,R 

and the deviation of IVjet from integer values is similarly suppressed by this ratio as long as 
E soit . R > E cu t. In this way, the near-integer behavior is determined by soft emissions in the 
vicinity of hard clusters, and soft emissions will generate logarithms of A n ±. 

By contrast, collinear splittings cannot generate near-integer behavior of IVjet and hence 
do not generate logarithms of A„-i-. For collinear splittings of angle R c , the only effect on IVj e t 
comes from particles who are within R of either of the two daughters or the parent, but not 
within R of all three. For small R c , this is a power-suppressed region of phase space and not 
logarithmically enhanced. Said another way, the effect on iVjet from a small-angle splitting 
is not smooth, as it either preserves the value of iVjet or discretely changes it by including 
or excluding particles from the various Ei n terms; this behavior cannot generate logarithms 
of A re ±. So unlike standard jet shapes (like jet mass) which depend on both the softness 
and collinearity of a splitting, the near-integer iVj et value depends only on the softness of 
emissions. 

2.3 Non-Global yet Local Structure 

We have argued that soft emissions contribute to near-integer values of iVj e t, but this is only 
the case if the soft emissions lie within a restricted phase space of size < 2 R around the jet. 
Wide-angle soft radiation does not contribute to iVjet, since those emissions yield Em < E cu t. 8 
This angular restriction on soft radiation produces non-global logarithms of A n ±, which are 
logarithms that arise from emissions in a restricted angular region of phase space [26, 27]. 

At leading order, non-global logarithms are often associated with correlated soft emissions 
from non-Abelian matrix elements and are therefore proportional to CfCa- For the case of 
fractional jet multiplicity, however, the measurement itself introduces a correlation between 
different emissions, and this effect appears for both non-Abelian and Abelian matrix elements. 
As we discuss in detail in Sec. 4.1, the allowed phase space for a soft emission to change Aj et 
depends on the phase space of other soft emissions and on the phase space of the hard partons, 
similarly to what happens for clustering logarithms [57-60]. In this regard, all soft logarithms 
of iVjet, including those proportional to Cp, can be considered non-global. 

Moreover, the fact that the allowed phase space for soft emissions to change IVjet depends 
on the phase space of the hard partons means that one cannot consider how IVjet depends 
on soft emissions without also considering collinear emissions, and vice versa. Because the 
contributions to IVjet from soft and collinear emissions are inextricably linked, we will show 
in Sec. 5.2 that IVjet does not obey standard soft-collinear factorization [19, 61-63]. As 

8 As discussed in Sec. 5.4, wide-angle soft radiation does affect the cross section at a fixed value of IVj e t, just 
not the iVj e t value itself. 
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mentioned above, collinear emissions by themselves do not generate logarithms of Aj et , but 
they do alter the allowed phase space for soft radiation within a jet region. Thus, collinear 
emissions will modify the coefficients of soft logarithms of iVj e t, which is a sign of collinear-soft 
non-factorization. 

While the A n ± dependence is non-global, it is also local to each jet. Emissions affect¬ 
ing iVjet are restricted to an angular region of < 2 R around each jet direction. Additional 
emissions away from all jets can also create their own jets, changing ATj e t by an integer or 
near-integer amount, but an emission far from all jets cannot change iVj et by a small amount. 
Because of color coherence, this implies that the contribution to Nj e t from a given jet is, to 
leading power, independent of all other jets in the event. 9 The Nj et distribution is therefore 
a sum of contributions from jet regions that are to a good approximation independently de¬ 
termined. We will formalize this “local” factorization structure in Sec. 5.4, and see how it 
could simplify LHC calculations in Sec. 6. 

3 Rapidity-like Divergences 

As mentioned in Sec. 2.2, the calculation of Nj et at O(a^) features divergences not regulated 
by dimensional regularization (dim reg) when the calculation is divided into unique soft 
limits. These divergences have a strong resemblance to rapidity divergences in SCETn, where 
the large rapidities of soft and collinear modes in the effective theory generate divergent 
integrals not regulated by dim reg [21, 22, 56]. The similar divergences appearing in the A] e t 
calculation are not from physical rapidities, but instead originate from the energy sharing 
between particles and are easily cast in terms of a “rapidity” of this energy sharing: 

y = \ ln w- t 3 - 1 ) 

Z Z2 

In this section, we show how these divergences arise in the Aj et calculation and how we can 
adapt standard rapidity regulators to our case. 

Instead of the complete A r j e t calculation, consider the simplified observable 

A = ziz 2 , (3.2) 

where particles 1 and 2 are particles that may be soft. A has the same soft scaling properties 
as the near-integer behavior of A r j e t (see Eqs. (2.10) and (2.11)), and the other complications 
from the complete Aj e t calculation are irrelevant for the discussion here. Like A 2+ to be 
calculated in Sec. 4.1, A receives contributions from both single-soft and double-soft regions 
of phase space. Also like A 2+ , A is a non-additive observable that does not get a linear 
contribution from each soft emission. 

9 Color coherence states that emissions collinear to a given jet and well-separated from all other jets are 
only sensitive to the kinematics and color of that jet and the anti-color of the jet, i.e., the color of all other 
jets in the event [64]. 
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1 soft, 2 collinear 


Z2 



Figure 5. The single- and double-soft modes depicted in the Z 1 -Z 2 plane for the simplified observable 
A = Z\Z 2 (similar to A 2 +). We assign the power counting A ~ A 2 . In the case with only one soft 
particle, the soft energy scales as A 2 , whereas in the case with two soft particles, the soft energies 
scale as A. These modes are connected by the hyperbola shown, which can be thought of as tracing 
out the rapidity in the energies of the two partons (see Eq. (3.1)). In our calculation, we encounter 
divergences in this rapidity variable for each mode depicted. This picture has a strong analog in the 
divergences in physical rapidity of single particles in SCETn. 


The interplay between the single- and double-soft limits is illustrated in Fig. 5. In order 
to have a consistent power counting for A, we need to consider two different scalings of the 
soft modes. Let A < 1 be the power counting parameter, with A ~ A 2 . The phase space 
regions with the same parametric contribution to A are: 

1 soft, 2 collinear (sc): z\ ~ A 2 , Z 2 ~ 1, 

1 soft, 2 soft (ss): z\ ~ A , Z 2 ~ A, 

1 collinear, 2 soft (cs): z\ ~ 1, Z 2 ~ A 2 . (3-3) 

Thus, there are two types of soft modes — single-soft modes scale as A 2 , whereas double- 
soft modes scale as A — and there is a unique soft power counting within each sector. This 
relative scaling is the same as ultrasoft modes in SCETi (which scale as A 2 ) and soft modes 
in SCETn (which scale as A). This indicates that the calculation has contributions to the 
observable from soft modes in both SCETi and SCETn. The “energy rapidity” variable y 
in Eq. (3.1) separates these phase space regions by measuring the relative energy sharing 
between particles 1 and 2. Like an ordinary rapidity, the range is y E (— 00 , 00 ). The sc 

region has z± < Z 2 (*/ ~> — 00 ), the ss region has z\ — Z 2 (|y| ~ 1), and the cs region has 

z 1 Z 2 (y —> 00 ). Each of the sc/cs/ss sectors gives an independent contribution to the 

A distribution at 0(a 2 ) which should be properly summed (i.e. one has to remove potential 
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double-counting). As we will now see, unbounded integrals in y produce divergences, not 
regulated by dim reg. 

In full QCD with dim reg in cL = 4 — 2e dimensions, the soft divergences of A are 
encapsulated by the integral 

/ full (A)= [ 1 ^^( ZlZ2 )-*6(A-z 1 z 2 ) 

Jo Z 1 z 2 

= A 12e J d ? /0 (-^ ln ( 1 / A ) < y < ^ ln ( 1 / A )) 

= ^S(A) - A (A) 0(A < 1) + 0(e) , (3.4) 

where C\ is a logarithmic plus-function defined in Eq. (B.3). If we take soft limits of the 
phase space, where the boundaries scale as [0,1] —> [0,oo], we get: 

1 soft: I sc ( A) = [ — [ — (z 1 z 2 )~ 2e 6(A - ziz 2 ) 

Jo z i Jo z 2 

= A’ 1 ’ 26 / 00 dy ©(—^ ln(l/A) < y) , 

2 soft: I cs ( A) = [ — f — (zi2 2 ) _2e 5(A - ziz 2 ) 

Jo z i Jo z 2 

= A- 1 - 2e y°°dy0(y<^ln(l/A)), 

1, 2 soft: I ss (A) = [ — [ — (ziz 2 )~ 2e 6(A - ziz 2 ) 

Jo z i Jo z 2 

/ OO 

d y. (3.5) 

-OO 

Each integral is divergent and unregulated, but the A < 1 regime of full QCD is reproduced 
by the combination 10 


ifuii(A) = [isc(A) + I cs (A) - I ss ( A)]0(A < 1). (3.6) 

This result is as expected, since the double-soft limit should remove the double-counting of 
the single-soft limits, even if each contribution is not well-defined individually. Note that the 
unregulated divergences appear only in y. 

This structure of divergences in y is identical to those in physical rapidities of single 
particles in SCETn. These divergences may be handled in a number of ways, and we will use 
the rapidity renormalization group (rapidity RG) [21, 22]. In the rapidity RG, divergences 
are regulated analogously to dim reg, using a scale u and infinitesimal parameter y that 

10 Note that outside this toy example, only the logarithms of A in full QCD will be reproduced by the soft 
limits, instead of the full A < 1 result. 
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correspond to the usual y, and e in dim reg. At one loop, these regulator factors are [21, 22] 
1 soft: = (|-) V = (-^) V’-'V’', 

2soft: ^ (i) 

1, 2 soft: = (-^) ?? s _r7//2 |2sinhy| _T ', (3.7) 

where s = and Ej is the total jet energy such that 2 * = Ei/Ej. These regulators give 
well-defined terms for each soft limit, 

1 soft: I sc ( A) = 


2 soft: I cs ( A) = 


1, 2 soft: I SS (A) = 


Now the sum of contributions is independent of the rapidity regulator and matches the full 
QCD results for A < 1: 

|/sc (A) + Ics( A) + /„(A)] 0(A < 1) = ^2 5(A) - A (A) 0(A < 1) + 0(e, rj) = / fu ii( A) . 

(3.9) 

Comparing Eq. (3.9) to Eq. (3.6), we see that with the rapidity regulator, the full result 
is reproduced by the sum of single- and double-soft contributions, instead of the difference. 
This happens because the single- and double-soft limits produce canceling poles in the large 
positive and negative rapidity regions. For example, the large rapidity limit is allowed in both 
the sc and ss contributions, where the rapidity regulators scale as 

1 soft: R^ c (y —> oo) ~ e y7J , 

1, 2 soft: R s v s (y —* oo) ~ e~ yv , (3.10) 

which lead to canceling poles: 

f °° 1 

1 soft: / dye yv = -, 

Jo V 

1, 2 soft: f d ye~ yn = - . (3-11) 

Jo V 


r <iZ ' f dZ2 (z 1 z 2 )-^8(A^z 1 z 2 )R s ri c 
Jo 21 Jo z 2 1 

f 1 — r — {z l z 2 )-^8{A-z 1 z 2 )R c v s 

Jo Z 1 Jo z 2 

r — r—( z i z 2 )- 2£ 8( a ~ zi z 2 ) ^ 

Jo Z 1 Jo 22 ' 




~ + 0(y) 
V 


(3.8) 
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Thus the cancellation of the large rapidity regimes happens at the level of the sum of contri¬ 
butions, instead of the difference. 

Not only does the rapidity RG make the various soft contributions well defined, but it 
also separates the double-soft limit from the zero-bin subtraction (needed to remove double 
counting [65]) of the single-soft limits . 11 Because the rapidity regulators in the single- and 
double-soft limits are not related by scaling in rapidities, the double-soft contribution is 
not the zero-bin of the single-soft limit. In fact, the zero-bin limit is scaleless; the rapidity 
regulator is unchanged in the zero-bin limit but now all rapidities are allowed, leading to the 
scaleless integral [ 22 ] 

/ OO 

dye ±yv = 0. (3.12) 

-OO 

In SCETn applications where the collinear and soft contributions to the distribution can be 
factorized into separated jet/beam and soft functions, this factorization allows these rapidity 
divergences to be divided into the collinear and soft functions and the resulting logarithms 
to be resummed using standard techniques (see, e.g., [ 21 , 22 ]). 

The features of rapidity divergences in this simple example of A repeat themselves in the 
calculation of near-integer behavior of Nj et (specifically, A 2 +) in Sec. 4.1 below. 


4 Calculating the Near-Integer Behavior 

As seen in Fig. 4, fractional values of !Vj et arise when radiation in a jet region extends beyond 
a radius R. In particular, near-integer values of A(j et come from soft radiation beyond R, and 
the deviation from integer values grows as emissions become harder. In this section, we study 
the leading-order near-integer behavior of _/Vj e t in e + e _ —> jets at a center-of-mass energy of 
Q, which first occurs for e + e - —> 4 partons. 

Let A 2 ± be the jet multiplicity near the 2-jet peak as defined in Eq. (2.3); we take the 
power counting A 2 ± ~ A 2 , with A < 1. We will first derive an analytic expression for the 
cross section at leading order in A (i.e. the singular contributions), including a discussion of 
the aforementioned rapidity-like divergences. We will then calculate the full C(a 2 ) result 
using the Monte Carlo program Event2 [66, 67], which allows us to include non-singular 
terms as well as cross check our results for the singular contributions. We will also calculate 
the 0 (a 2 ) contributions to A 3 _, though this is not the focus of our studies. 

4.1 Singular Contributions using Splitting Functions 

The cross section for A 2 ± can be written as 
da 


dA 2 ± 


= / d$ 4 f(e + e —> 4 partons) J r (A 2 ±, ^ 4 ), 


(4.1) 


11 In this case, the zero-bin limit is the overlap of the single- and double-soft limits, and is tantamount to 
removing the restriction on y in the single-soft limits in Eq. (3.5). 
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where $4 represents four-body phase space, T is the matrix element for e + e —> 4 partons, 
and J 7 (A 2 ±,$ 4 ) is the measurement function, which projects out the slice of phase space 


corresponding to a constant value of A 2 ±. The allowed 

values 

of 

lVj e t for four partons 

1 2 

are: 

Four isolated partons: 

^Vjet 

= 

4, 


(4.2) 

Two isolated partons and one merged pair: 

^Vjet 

= 

3, 


(4.3) 

One isolated parton and one merged triplet: 

^jet 

£ 

(2 

~(E cut /Q) 2 ,3), 

(4.4) 

Two merged pairs: 

^jet 

= 

2 . 


(4.5) 


The only non-integer behavior is obtained for the merged triplet in Eq. (4.4), which 
requires three particles to be within an angular distance < 2 R. Thus, for sufficiently small 
R, we can use the matrix element in the limit where three partons are collinear, allowing us 
to take advantage of collinear factorization [66-70] : 13 


T(e + e —> 4 partons) ~ T(e + e —> qq) ■ ^ 7fc CoI1 (l —> 3) 


= T(e + e —> qq) ■ 


(8ira s g 


2e\2 


T 23 




lU3} 5 


(4.6) 

(4.7) 


where k labels one of the parton channels q —> ggq , q —> ggq , q —> q'q'q , or q — > q'q'q , S 123 
is the squared invariant mass of the three parton system, and (P('i^ 3 ) is the spin-averaged 
1 —> 3 splitting function [76, 77]. The factorization in Eq. (4.6) implies that the relevant 
phase space for the collinear splitting is 3-body, meaning we decompose the 4-body phase 
space of the whole event into the 2-body qq system of the hard interaction and the 3-body 
phase space of the 1 —> 3 splitting. In the collinear limit, the 4-body phase space factorizes 
as d4>4 = d4?2 (qq) x dsi23/(27r) x d4>3(l — > 3). 

The collinear matrix elements can be further decomposed according to their color struc¬ 
ture. The channel q —> ggq (and q —> ggq) can be written as a sum of Abelian (“ab”) and 
non-Abelian (“nab”) contributions: 


T coll (q —> ggq) — CpT^ ll (q —> ggq) + CpCAT^^(q —> ggq), (4-8) 


while the q -» q'q'q (and q —>• q'q'q) channels will give contributions proportional to the 
CpTp color structure . 14 At O(a^) the cross section for A 2 ± is obtained by summing over all 
channels and color structures. In the rest of this subsection we will discuss the calculation of 

1J Here we assume that each isolated parton and group of partons are sufficiently energetic to pass the energy 
cut Scut. If this does not hold, then the integer values of JVj e t = 3 or 4 may be reduced to 2 or 3. 

1J We use spin-averaged splitting functions, which are sufficient since the value of JVjet for the collinear system 
is independent of its orientation relative to other jets in the event (see, e.g., [71]). This 1 —> 3 splitting function 
approach also appears in Ref. [72] for calculating the jet substructure observable planar flow [73-75]. 

14 For same flavor final state quarks q' = q, this matrix element contains also terms proportional to Cp and 
CfCa■ However, these terms do not contribute to the cross section at leading power. 
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the Abelian piece of the cross section; results for the other color structures can be found in 

App. A. 

The Abelian contribution comes from a q —> ggq or q —> ggq splitting. The two channels 
give identical contributions and we will focus on the q —> ggq case for definiteness, labeling 
the final state as: 

.91.9293- (4.9) 

As discussed in Sec. 2.2, at leading power we can take the soft limit of our observable, which 
corresponds to either one or both gluons becoming soft. As in the toy calculation in Sec. 3, we 
will label the limit where gluon 1 (gluon 2) is soft as sc (cs) and the double-soft limit as ss. 
We again use the energy sharing variables z% = Ei/Ej and scaled veto variable 2 cu t = E cut /Ej 
in Eq. (2.8), where here Ej is the energy of one jet region (with Ej ~ Q/2 for a collision at 
center-of-mass energy Q ). We generally assume 2 cut -C 1 , which allows us to neglect power- 
suppressed contributions from soft quarks, and if some z % < 2 cut then that particle may be 
treated as soft. 

In Tables 1 and 2 we collect contributions to A 2 ± in the relevant soft limits and regions 
of phase space. At this order, we also get a contribution to A 3 _ = 3 — IVj et , and we will carry 
out the calculation for A 3 _ as well. 

We now gather the relevant pieces to evaluate Eq. (4.1) in the collinear limit. There are 


three relevant regions of phase space: 

Ua = 0(013 > R) 0(023 < R) 0(012 < R), (4.10) 

lZ B = n A { 1**2), (4.11) 

-Re = 0(013 < R) 0(023 < R) 0(012 > R), (4.12) 

where 0,^- labels the opening angle between partons i and j. In Table 1 we show a schematic 
representation of the angular phase space regions and soft limits which are explicitly listed in 
Table 2. From Table 2, the measurement functions are given by 

^(A 2 _,ch4)=Tr 2 _, (4.13) 

^(A 2+ , $4) = n c 2+ + E£ + + TT 2+ , (4.14) 

^(A 3 _, < f> 4 ) = , (4.15) 

where 

■TaL = <5( a 2- - z x z 2 ) 0(^i + z 2 < z cut )TZ A + (1 2), (4.16) 

E£ + = 5 [A 2+ - Z! (1/22 - z 2 )] 0(22 > Z cnt )U A 
+ (5[A 2+ - 2l2 2 (2 - z 2 )/(1 - z 2 )\n c , 

^a 2+ =^a 2+ (1^2), (4.18) 

E& + = 5(A 2+ - 2212 2 ) n C , (4.19) 

^As_ = 5 (As- - 21 [1 - 22(1 - z 2 )]/ \z 2 (1 - 2 2 )]) 0(22 > z cut )H B , (4.20) 

=E^_(l^2). (4.21) 
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Observable 

TZa 

IZb 

TZ C 


gi ga qa 

ga/gixqa 


A'2— 

l *(•)• J 

1 •(•]• j 





gi/qa ga gi qa ga 

: • • • ;' • • • j 


f gi /ga\ qa \ 

( g2/gi\qa \ 

V X. ^ ' V >C y 

a 2 + 

( • • • j 

( • • • j 

gi qa ga 

• • • j 


gi ga qa 

g2/gi\q3 


A 3 - 

; • • • ; 

i • • ) 



Table 1. Representation of the phase space configurations contributing to the near-integer jet multi¬ 
plicities A 2 —, A 2+ , and A 3 _. Compared to Fig. 4, the value of 7Vj et is 1 unit higher, because the event 
contains an additional isolated parton (not shown). For each observable, we show the corresponding 
contributions from different angular regions and soft limits. Circles represent cones of radius i?, large 
blue dots represent energetic partons, small red dots soft partons with 2 < z cut . The angular regions 
7 Za, TZb, and TZc are defined in Eq. (4.10). 


Observable 

Region 

Expression 

Limit 

Cuts 

a 2 _ 

IZa 

z\ z 2 

1, 2 soft 

Z! + Z 2 < Z cut 

a 2 

IZb 

Z\Z 2 

1, 2 soft 

Zi + Z 2 < z cut 

a 2 + 

IZa 

2 i(l - zl)/z 2 

1 soft 

Z2 > Zcut 

a 2 + 

IZb 

Z 2 { 1 - zf)/zi 

2 soft 

Zl > Z cut 

a 2 + 

1Z C 

Z!Z 2 (2 - z 2 )/{ 1 - z 2 ) 

1 soft 

- 

a 2 + 

TZc 

ziz 2 {2 - zi)/{ 1 - Z\) 

2 soft 

- 

a 2 + 

TZ C 

2z\z 2 

1, 2 soft 

- 

A 3 - 

IZa 

Z 2 [ 1 - Zl{\ - Zl)]/[zi(l - Zl)\ 

2 soft 

Zl > Zcut 

A 3 - 

IZb 

Zl[l - Z 2 (l - z 2 )\/[z 2 ( 1 - z 2 )\ 

1 soft 

Z 2 > Zcut 


Table 2. Near-integer behavior of 7Vj e t, shown for various phase space regions as depicted in Table 1. 
In each case, the expression for the observable is given along with the relevant limits and phase space 
cuts. 
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The single- and double-soft limits for the 1 —> 3 matrix elements are 


%b(q -t gig2Q3 ) - 


'-TSC 

' ab 

< XS 

'-rss 
' ab 


4(47ra s /i 2e ) 2 2-2 z 2 + z%( 1 - e) 

E J @13&23 z l z 2(l ~ z ?) ’ 

T£(l o 2), 

8(47ra s /x 2<E ) 2 1 

E J ^13^23 z l z 2 


(1 soft, 2 collinear) 
(1 collinear, 2 soft) 
(1 soft, 2 soft) 


(4.22) 


where we have included a symmetry factor of 1/2! for identical gluons. Notice that in the 
double-soft limit, the Abelian matrix element simply reduces to the product of two eikonal 
factors. In the collinear regime, the 3-body phase space can be written as [78] 


E 


4—4e 
J 


d<]?o o11 = - 

3 (4vr) 4 - 2e r(l - 2e) 

where the angular phase space d4>o is given by 


dd> 2 , 


(4.23) 


d$o = d0 2 3 d0 2 2 3 d^ 2 0(A)A^ 1 / 2 - e , A = 4 9l 3 0 2 23 - (e 2 V2 - 0 2 3 - 0 2 3 ) 2 , 


(4.24) 


and the energy phase space dd> 2 is given by 


d4> 2 ~ 


dd>f = 0(2i < 00 ) 0 ( 2:2 < l)[ziZ2(l - z 2 )] 1 ^ 2e dzidz2, 

< d<I>f = d$f(1 <->■ 2), 

dd?* s = 0(2:1 < 00)0(2:2 < co){ziZ2) 1 ~ 2 e dz\dz2- 


(1 soft, 2 collinear) 

(1 collinear, 2 soft) (4.25) 
(1 soft, 2 soft) 


As discussed in Sec. 3, for certain contributions to the observable, these energy integrals 
give rise to rapidity divergences which require special regularization to make each term well 
defined. We use the rapidity regulator to do this, and the sum of all contributions is regulator 
independent. Specifically, the contribution to A 2+ from 7 Zc is the only one with both single- 
and double-soft limits (see Tables 1 and 2), and hence the only contribution which requires 
this extra regulator. While A 2 — does receive contributions from the double-soft region of 
phase space, the constraints on the gluon energies from 2 : cu t implies that both gluons must 
be soft, and hence the single-soft limits do not contribute. The limit on the gluon energies 
also implies there is a kinematic limit on the observable, A 2 - ^ (Zcut/2) • 

By combining the measurement function with the proper limits of the matrix elements, 
we get an analytic expression for the Abelian contribution to the A 2 ± and A 3 _ distributions 
at leading power in A. Including the identical contribution from q —> qgg yields 


da 

dA2_ 

da 


J ab 


dA 


2+J ab 


da 

dA^I 


ab 


C 2 (^) 2 Xo{-2£i(A2-) + 41n2 cut £o(A2-)}0(A 2 - <2 c 2 ut /4), (4.26) 

C 2 F ^) 2 J f2 |-^£ 1 (A 2+ ) + + y hr 2 - 21nz cut )x 0 (A 2+ )} , (4.27) 

Cl (^) 2 X n (-^ -21n2 cut ^A)(A3-), (4.28) 
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where the distributions C n are the usual logarithmic plus distributions, defined in App. B. 
The integral over the angular phase space is given by 15 


_ 4 f d<£>^ _ 57r 2 

- w ^ 2 y54 • 


(4.29) 


Eqs. (4.26), (4.27), and (4.28) represent the leading order distributions in the small R 
and Zcut limits. Notice that at this order there is no dependence on R, as the 1 /Ofj factors 
in the denominator of the angular integral provide a logarithmic scaling which can be used 
to explicitly remove the R dependence. Since we are focused on the non-integer behavior of 
iVjet, we have suppressed contributions proportional to 6(A n ±) in our results (see, however, 
Eq. (5.5)). 

The calculation of the CfCa and CfTruj contributions are very similar to the Abelian 
case, albeit more involved as the matrix elements do not simply factorize into separate angular 
and energy functions as in Eq. (4.22). In App. A, we give the full set of results for the different 
color structures. 

In the 0(o!g) calculation performed above, double logarithms of A n ± (appearing as 
£i(A n ±)) arose when both emitted gluons became soft, and single logarithms (£o(A n ±)) 
arose when a single gluon became soft. This correspondence holds true with more soft emis¬ 
sions: each soft emission results in a single logarithm of A n ± (in certain regions of phase 
space). Therefore, in general the largest logarithm at 0(a(?) appears when all emissions are 
soft gluons, and the contribution to the cross section is of the form a k s Ck~ i(A n ±). 


4.2 Nonsingular Contributions from Event2 

To go beyond leading power in A, we need to incorporate the Ota^) matrix elements from 
full QCD. For fixed-order calculations of e + e~ —> partons at low multiplicity, the program 
Event2 is a particularly useful tool [66, 67]. Event2 performs next-to-leading order cal¬ 
culations of e + e _ -* 2 and 3 partons, so it includes the needed e + e~ —> 4 parton tree-level 
matrix elements. Conveniently, it allows for the decomposition of its results order-by-order 
in a s and by color structure. Crucially, Event2 can probe the far infrared regions of phase 
space, meaning the (singular) logarithmic terms of A 2 ± and A 3 _ are enhanced. This allows us 
to perform a robust comparison and cross check of our splitting function calculations above. 
Having verified the singular logarithmic contributions to A 2 ± and A 3 _, we can then extract 
the nonsingular contributions directly from Event2. 

In Fig. 6, we show the O(a^) contributions to A 2 ± and A 3 _ extracted from Event2. 
We plot the coefficients of the (a s /2 tt) 2 C terms in the cross section as a function of In A n ±, 
where C is the relevant color factor (Cp, CfCa , or CfTfUj). To enhance the logarithmic 
contributions and minimize the power corrections in R and z cu t, we choose the small values 
R = 0.1 and z cut = 0.02. Plotted this way, double logarithms (£i(A n= |-)) appear as lines 

15 We have written the common angular coefficient In in terms of the phase space region 1Za- The same 
angular coefficient appears for regions 1Za, TZb, and 7 Zc due to an unexpected symmetry in the angular 
integrals. This symmetry is only present for certain angular integrals in the non-Abelian case. 
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0(a;) contributions to /V jet in full QCD 



Figure 6. The A 2 ± and A 3 _ distributions extracted from Event2. Shown are the separate Cp , 
CfCa , and CpTpUf contributions to the cross section, plotted as the coefficient of (a s / 27r) 2 C, where 
C is the relevant color factor. Note that the A 2 _ and A 3 _ axes run backwards. 


O(af) contributions to /V jet with logs subtracted 



Figure 7. Same as Fig. 6, but subtracting our calculations for the singular contributions in each 
region. The residuals vanish in the logarithmic (A„± —» 0) regime, indicating that the only remaining 
terms are nonsingular. 
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of constant non-zero slope, 16 single logarithms (£o( A n-t)) appear as lines with zero slope 
and non-zero offset, and nonsingular contributions vanish as A n ± —> 0. It is clear that 
logarithmically-enhanced terms are indeed present in the full QCD result from Event2. 

To make sure the logarithmic behavior from Event2 matches our analytic calculations 
in Sec. 4.1 and App. A, we can extract the nonsingular contribution to the cross section, 
which are the residual fixed-order terms after the logarithmic contributions are subtracted: 


d<7 ns dcTfuii dcr s i n g 

dA n -j- dA n -t- (\A n . 


(4.30) 


These are shown in Fig. 7, again separated by color structure, and confirm that our splitting 
function calculation, which includes the leading contributions in a small R and 2 cu t expansion, 
captures the leading-order near-integer behavior of JVj e t correctly. 1 ' 

For JVj e t < 2, the singular terms tend to dominate over the nonsingular ones. The reason 
is that the A 2 - distribution has a kinematic endpoint at (z cu t/2) 2 < 1 at this order, so the 
distribution only has support in the singular region of phase space. 

For 2 < JVjet < 3, there is an ambiguity in determining the nonsingular terms in the cross 
section, related to the fact that singular terms exist at multiple points in the JVj e t spectrum. 
In this range, there are singular terms from A 2 + that dominate near JVjet = 2 and singular 
terms from A 3 _ that dominate near JVj e t = 3. In Fig. 7, we defined the nonsingular term by 
removing the singular terms from the nearest singular point in the JVj e t distribution, using 
JVj e t = 2.5 as an arbitrary dividing line. 18 Alternatively, we could define the nonsingular term 
by removing both sets of singular terms over the entire spectrum. 19 This is a valid approach 
as well, since the singular A 2 + and A 3 _ distributions are governed by different soft limits of 
phase space, hence there is no double-counting by including both singularities. 

Since we are mainly interested in describing the behavior in the vicinity of JVj et = 2, 
though, for the rest of the paper we will simply adopt the nonsingular definition in Eq. (4.30), 
even in the vicinity of JVj et = 3: 


da„ 


dJV; 


-(2 < JVjet < 3) 


jet 


d(Tns 

dA2+ 


dcr f u ii d(T s i n g 

d JVj e t dA 2+ ' 


(4.31) 


16 Note that these are not Sudakov double logarithms, since they appear at (b(a 9 ), not 0(a s ). 

11 Power suppressed terms of the form 0(R 2 , z cu t) £ 0,1 (A n ±) are generically present in the nonsingular terms, 
though their contributions are negligible in Fig. 7 since we have chosen both R and 2 cut small. 

18 This corresponds to the nonsingular definition 


^(2 < JVjet < 3) 
d JVjet 


do-full 

d JVjet 


do" s i n g 

dA2 + 


e(JVj et < 2.5) 


+ 


do" s i n g 

dA 3 _ 


©(JVjet > 2.5) 


9 This would correspond to the nonsingular definition 


d<Jns (2 < JVjet < 3) = d,Tfu11 


diVi 


jet 


diVi 


jet 


da s i n g 

dA2+ 


do" s i n g 

dA 3 _ 


- 22 - 






















singular / nonsingular decomposition at 0 (a^) 



Figure 8. The 2 < iVj et < 3 distribution in full QCD (black, solid), decomposed into singular 
contributions from A 2 + and A 3 _ (blue, dashed/dot-dashed) and residual nonsingular contributions 
(red, dotted). The coefficient of (a s /27r) 2 is plotted for each contribution. Note that the sum of 
singular contributions is numerically dominant over the entire range. 


That said, this non-singular term turns out to be dominated by the singular A 3 _ piece. 
Writing the nonsingular term as 

doTs _ d(7 s i n g | d(T res ^ A 9r) ^ 

dA 2 + = dA 3 _ + dA 2+ ’ 1 ' ’ 

the residual term is quite small, even in the vicinity of JVj e t = 2.5. This is shown in Fig. 8, 
where the full QCD result between 2 < JVj e t < 3 is decomposed into the A 2+ singular, A 3 _ 
singular, and residual terms for R = 0.4 and z cu t = 0.2. The fact that the A 2+ nonsingular 
term is nearly saturated by the A 3 _ singular term suggests that higher-order logarithmic 
terms can play an important role in determining the shape of the Aj et distribution, even for 
large deviations from integer values of the observable. 


4.3 Complete Results to Order a 2 

Summarizing the results from Secs. 4.1 and 4.2, the full Oia^) result for the near-integer A 2 ± 
distributions can be written as 


da 

dA 2 -t 


du 

dA 2 -t 


+ 


- ab 


dcr 

dA 2 ± 


+ 


nab 


da 

dA 2 ± 


+ 


- C F T R rif 


do~ns 

dA 2 ± 


singular 


(4.33) 


- 23 - 






















The first three terms in Eq. (4.33) are the singular contributions to the cross section. The 
Abelian terms are given in Eqs. (4.26) and (4.27), while the contributions of non-Abelian 
and CpT^rij color structures can be found in App. A. The last term in Eq. (4.33) is the 
nonsingular contribution that we extract numerically from Event2 as discussed above. 

As discussed in Eq. (4.31), the entirety of the A 3 _ singular distribution in Eq. (4.28) has 
been absorbed into the A 2 + nonsingular distribution. Thus, when we plot A 3 _ in Sec. 7, we 
are in effect plotting the prediction for A 3 _ = 1 — A 2 +. 


5 Towards a Factorization Theorem 


In the previous section, we have seen that soft and collinear matrix elements govern the 
near-integer behavior of lVj et in fixed-order QCD. In this section, we explore the all-orders 
behavior at and near Aj e t = 2, building a candidate factorization theorem to describe the 
logarithmically-enhanced contributions to the cross section. 

Near A r j et = 2, energetic collinear modes must be confined to two jet regions of diameter 
R, such that no two energetic particles are more than R apart. This suggests that the collinear 
radiation may be described by jet functions. Additional radiation must be sufficiently soft 
so as not to create an additional jet, meaning there is a local veto of size z cnt outside of the 
primary jets. This suggests that additional radiation may be described by a soft function. 
However, while one might hope that a factorization theorem of the form 


(j(A2±) ~ do Hqq J q ( A 2 ±) <8> Jq( A 2 ±) <8> S qq ( A 2 ±) 


(5.1) 


would hold, it fails on several fronts. First, this standard factorization picture is challenged 
by both the non-additivity of the observable and soft-collinear non-factorization (discussed 
in Sec. 2.3 and examined in more detail below). Additionally, the logarithms of A 2 ± are 
non-global, and may not be summed with the above factorization into jet and soft functions. 
Furthermore, the cross section at A r j et = 2 (A 2 ± = 0) behaves similarly to a dijet cross section 
from a standard discrete jet algorithm, for which a convolution structure does not apply. 

We will discuss these issues below, en route to a candidate “local” factorization theorem 
which applies in the small R limit. This factorization theorem will take the form 


dcx 

dA2± 


= cr (IVjet = 2) 


Cq(A 2 ±) <8> C'g(A2±) , 


(5.2) 


where cr (A r j et = 2) is the cross section exactly at A r j et = 2 (see Sec. 5.1) and C q ^ are “collinear 
functions” (see Sec. 5.3). This form satisfies a number of plausibility checks, but a formal 
proof of its validity is beyond the scope of the present paper. 


5.1 The Cross Section at N- ]et = 2 

We begin by considering the cross section at exactly Aj eti = 2. At 0(a s ), the constraints to 
obtain Aj e t = 2 are the same as for obtaining two jets from a discrete jet algorithm, namely 
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that either two of the three partons must lie within a mutual radius of R or one of the partons 
must be below E cnt . At leading power, the phase space restrictions on the qqg final state are 


&{6 qg <R) + Q(d qg <R) + @(9 qg > R)Q(9 qg > R)Q{z g < z cut ). (5.3) 

At higher orders, the phase space restrictions differ from discrete jet algorithms (which 
also differ among themselves), and the lVjet = 2 cross section has unique features. Consider 
an 0(a! 2 ) configuration contributing to the near-integer behavior, as discussed in the Sec. 4.1. 
A concrete example is the A 2 +-R-C configuration in Table 1, which has a collinear q —>• qg 
splitting with another soft gluon (labeled s) emitted such that 


@qg ^ R , 6 qs > R , 6 gs <C R . 


(5.4) 


This event will give a non-integer contribution to N- ]e t, and its contribution to the cross section 
is of the form 


r^j 


a„ 


^2<j(Ajet — 2) + ^<5(A(jet — 2) + Kq £o(A2+) + Hi Tl(A2+) 


(5.5) 


The plus distribution terms were already calculated in Eq. (4.27). The <5(A2+) terms were 
suppressed in our previous discussion (since we were focused on the non-integer behavior), 
but both the 1/e 2 term and the </ term follow directly from the calculations in Sec. 4.1. 

The structure of Eq. (5.5) reflects the hybrid jet algorithm/event shape behavior of lVj et . 
The plus distribution terms have no support at lVj et = 2, since the plus function prescription 
removes any divergence there. 20 This feature is similar to other event shape variables with a 
singular limit, such as thrust, where there is zero cross section in the singular limit at higher 
orders. By contrast, the delta function terms behave more like a jet algorithm, as anticipated 
in Eq. (2.4), with a finite cross section at exactly lVj et = 2. The IR pole from the real radiation 
at A(j et = 2 will cancel divergences from virtual matrix elements, such that the lVj e t = 2 cross 
section is IR finite. 21 The additional finite term at lVj e t = 2 implies that the non-global 
structure in the near-integer part of the cross section is also contributing at lVjet = 2. 


5.2 Soft-Collinear Non-Factorization 

The non-global nature of the near-integer behavior of !Vj e t is suggestive of a non-standard 
picture of factorization. In fact, the lVjet observable does not obey soft-collinear factorization, 
meaning separate soft and collinear functions cannot be easily (or usefully) defined. This was 
foreshadowed in Sec. 2.3 and can be illustrated with a simple example at 0(ct 2 ). 

20 To see this, we can use the definition of the plus function in Eq. (B.l). Integrating [g(a:)] + against 5(x) 
gives the value at 0, 

[?(0)]+ = J dx fe(*)]+ = 0 • (5-6) 


21 It is straightforward to see that at (b(a 2 ) the cross section at lVjet = 2 must be IR finite. Since non-integer 
values of lVjet , as well as lVjet = 3 and 4, have IR finite cross sections, and since the inclusive cross section is 
IR finite, the remaining piece, the cross section at lVj e t = 2, must be as well. 
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Consider the same phase space configuration as Eq. (5.4), which has a single soft gluon 
and a pair of collinear (energetic) partons. The soft gluon is in the region of phase space where 
it is in the jet region of only one of the collinear partons, and the value of the observable is 
dependent on this fact. This implies that the observable receives contributions that intrin¬ 
sically depend on the soft and collinear modes in a non-factorizable way; the measurement 
function cannot be separated into separate soft and collinear pieces that do not depend on 
each other (see Fig. 4). Therefore, soft-collinear factorization does not hold. We note that, 
unlike other cases where soft-collinear factorization is not straightforward (see, e.g., [63, 79]), 
here the leading non-integer behavior does not factorize. 

Because the non-global structure of A 2 ± feeds into the IVjet = 2 cross section through the 
£ term in Eq. (5.5), the same non-factorization is also true of the integer value. Of course, 
for the exact IVjet = 2 case, non-factorization of the soft and collinear modes happens only 
at O(a^). Therefore, one can write the IVj et = 2 cross section as the sum of terms, one 
with global contributions that can consistently be resummed and one with non-factorizing 
contributions. Such a form is 


(TVjet — 2) — (TQHqq^Q , /l) Jq(Q , R, ^cut, M) Jq(Q j 7?, £cut; ft)Sqq(R, ^cut, /^) 

(Q, R , ^cut; AO • 


i lion—IVic / 
+ <J 2 


(5.7) 


The factorized part of the cross section is similar to dijet (or, generally, exclusive jet) cross 
sections. In e + e _ collisions, such cross sections are known to contain non-global logarithms 
that spoil a standard effective theory picture of the dynamics. These non-global logarithms 
span the jet and soft functions, affecting the RG evolution in nontrivial ways. Similar to the 
non-factorizing terms, the non-global contributions start at O(a^). 


5.3 Introducing Collinear Functions 

Let us now try to simultaneously describe the near-integer and exact-integer behavior of 
IVj e t ■ We have established that the logarithms of A 2 ± are purely non-global, and there is no 
standard soft-collinear factorization. As mentioned in Sec. 2.3, however, the contributions 
from each jet region are independent, meaning they can be separated: 

——~ er 0 C' 9 (A 2 ±) < 8 > Cq(A- 2 ±) , (5.8) 

dA 2 ± 

where oq is an overall prefactor and <S) refers to the standard convolution in Eq. (B.6). Here, 
we have introduced “collinear functions” C q and Cq for the two jet regions, which give the 
contribution to A 2 ± for each jet region separately. 

The collinear functions contain only the singular terms and have the general form 


n—1 


Cq,q{ A 2 ±) - <5(A 2= t) + y, £fc(A2+) + /4"_ £fc(A 2 -) 


n=2 


k =-1 


(5.9) 
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where are coefficients which in general depend on z cut (except for the leading coefficient 
K ri-] ±)- Recall that £_i(x) = 5(x), and these delta functions are needed to describe the 
iVj e t = 2 cross section. Since A 2 ± describe different regimes of the same observable (Aj et just 
above/below 2 ), each term in the expansion has support for one term or the other. 

The convolution between the collinear functions in Eq. (5.8) mixes the distributions for 
A 2 + and A 2 - • Therefore, we require not only the usual convolutions between distributions of 
either A 2 + or A 2 - (which we refer to as convolutions of one-sided distributions, e.g. £o(A 2 +)< 8 > 
£ 0 (^ 2 +))) but also convolutions between distributions of A 2 + and A 2 - (which we refer to as 
convolutions of two-sided distributions, e.g. £o(A 2 +)< 8 >£o(A 2 -)). Details about the definitions 
of one- and two-sided distributions and convolutions between them are presented in App. B. 

We can now reinterpret the calculation in Sec. 4.1 directly as a calculation of C q and C q 
to 0(o!g) (see App. C). The convolution between C q and C q in Eq. (5.8) then gives part of 
the higher-order £>(af) cross section. In addition, we can estimate the 0(aq) structure of an 
individual collinear function by performing naive (Abelian) exponentiation to capture some 
of the multiple emission terms. That is, we assume that each of the collinear functions in 
Eq. (5.9) has the form 


C q , q ( A 2± ) = <5(A 2 ±) + (^M 


2 


/c + (A 2+ ) + /c_(A 2 -) 



aJjj) 

7T 


/c+(a 2+ ) + /c_(a 2 _) 


/C + (A 2+ ) + /C_(A 2 _) , 


(5.10) 


where 


£+(A 2+ ) = «gJ.£ 0 (A2+) + «^£i(A 2 +) , (5-11) 

/C_(A 2 _) = 4 2 1^o(A 2 -) + 4 2 1Ai(A 2 _) . (5.12) 

The convolution terms at 0(af) are those coming from naive Abelian exponentiation in one 
jet region. Note that this does not fully capture the correct higher-order terms (for example, 
we do not get any term at O(a^) from this exponentiation), but we will see in Sec. 7 that it is 
enough to reproduce some of the higher-order effects observed in parton shower Monte Carlo 
generators. In defining Eq. (5.10), we have used the fact that one can absorb corrections to 
the Aj et = 2 cross section into the <7o prefactor (see further discussion below). 


5.4 A “Local” Factorization Theorem 

Via Eq. (5.8), we can capture the impact on Aj e t of soft and collinear emissions within the 
two jet regions. But soft emissions away from the jet regions can still give logarithmically- 
enhanced contributions to the cross section, even if they do not change the value of Aj et . For 
example, a soft gluon well-separated from the jets with energy fraction less than z cu t will not 
change Aj et , but it will contribute to the cross section. For e + e _ —> qqg , this wide-angle soft 
radiation is part of the Aj et = 2 phase space. Going to higher orders, there is a contribution 
to both the exact-integer and near-integer cross sections from wide-angle soft radiation, and 
up to power corrections for small R, both contributions are identical. 
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This logic implies that the JVj e t = 2 cross section should multiply the near-integer con¬ 
tributions, leading to the candidate factorization theorem from Eq. (5.2), repeated for conve¬ 
nience: 


da 

dA 2 ± 


= cr(iVjet = 2) 


C q (A 2± ) ® C q (A 2± ) 


(5.13) 


This is the same structure as Eq. (5.8), but we have identified a o with a(Nj et = 2). As 
discussed in Eq. (5.7), cr(Aj e t = 2) itself has its own quasi-factorization theorem. If this 
candidate factorization theorem is indeed true, then the definition of C q , q ( A 2 ±) in Eq. (5.10) 
should be revised such that the coefficient of the (5(A 2 ±) piece is always 1 (i.e. the sum over 
k should start at 0 instead of —1). 

We stress here that Eq. (5.2) is only a candidate factorization theorem, and we have not 
proven that such a form exists. In particular, we do not have an operator definition of the 
collinear functions C qtq , and without such a definition, the extraction of C q _ q is ambiguous 
from fixed-order calculations alone. The reason is that the structure of C g (A 2 ±) <8> C q ( A 2 ±) is 
single-logarithmic order-by-order, without any definite relation between coefficients, so higher- 
order terms can absorb corrections from lower-order ones. For example, the leading nontrivial 
terms in cr(IVj et = 2) (the 0(a s ) term) and C q ( A 2 ±) <g> C q ( A 2 ±) (the 0(a 2 s C i) terms) will 
contribute to (the ©(a^) £i(A 2 ±) terms), but cannot unambiguously determine that 
coefficient. 

Despite these limitations, our candidate factorization theorem does describe important 
effects (like convolutions between the two jet regions) that go beyond a simple perturbative, 
log series expansion, which is why we will use Eq. (5.2) in our comparison studies in Sec. 7. 
One thing we can say unambiguously is that if the factorization theorem in Eq. (5.2) is 
valid, then the collinear functions must be RG independent. The reason is that the prefactor 
cr(A r j et = 2) is itself a cross section, so it must be RG independent, and therefore C q <S> C q 
must be RG independent. Similarly, the modes that contribute to C q and C q are completely 
disjoint, so there is no possibility that RG-scale-dependence could cancel between C q and C q . 
This does not rule out, however, a further factorization of the collinear functions. 


5.5 Complete Results 

Let us summarize our final prediction for the A 2 ± distributions using the candidate factor¬ 
ization theorem in Eq. (5.2). Beyond the calculation summarized in Sec. 4.3, we can include 
two higher-order effects. 

First, as described in Sec. 5.3, the collinear function approach allows us to capture 0(c4) 
terms coming from convolutions. There are two types of convolutions: convolutions between 
C q and C q given by the factorization theorem, and convolutions within an individual collinear 
function coming from naive Abelian exponentiation. Though the naive exponentiation can in 
principle give results to all orders in a s , we will truncate the convolutions to order O(o^), 
especially since there are known C^ccf) terms missed by this approach. 

Second, we can include running coupling effects by evaluating a s {(x) at an energy scale 
p, = QyJ A 2 ±. To see why this is the correct scale, note that in Sec. 4.1, the fixed-order 
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expansion in dim reg had a prefactor of 




e 2 a 2± 


2e 


(5.14) 


where Ej ~ Qj 2 is the jet energy. This implies that at higher orders, the fixed-order calcula¬ 
tion generates terms like \n(n 2 /E 2 A 2 ±) where ji is the renormalization scale, suggesting that 
QyJ Ao± is a relevant running coupling energy scale. 

As in Sec. 4.3, the nonsingular contributions to the cross section enter at fixed 
and can be simply included additively. Like in Sec. 4.3, we absorb the entirety of the A 3 _ 
singular distribution into the A 2 + nonsingular distribution. To try to capture some higher- 
order effects in A 3 _, though, we make use of the nonsingular decomposition in Eq. (4.32), 
repeated for convenience: 

d(Tna dcTsine (1 (7 res 

(5.15) 


del; 


sing 


+ 


day, 


dA2+ dA3_ dA2+ 

We evaluate the singular A 3 _ piece at the scale fi = Q^/A^Z and the small A 2 + residual 
term at a fixed scale /r = Q. While this approach misses out on genuine fixed-order 

corrections from 3-jet events, they have an endpoint at A 3 _ = (z c u t/2 ) 2 analogous to Eq. (4.4), 
so we will simply not make a prediction for A 3 - < (■Zcut/ 2 )". 

Finally, while the singular terms for A 2 ± come with an overall prefactor of a(Nj e t = 2 ), 
the nonsingular terms do not. However, since the nonsingular terms start at 0(a 2 ), we can 
multiply them by a(Nj et = 2)/gq, which introduces corrections beyond the order to which we 
are working. This allows us to factor out a global cr(IVj et = 2) from our final predictions. 

Putting these pieces together, our final analytic prediction for the A 2 ± distribution is: 


d<7 

dA2± 


= a 


(Ajet = 2 ) |c , q(A 2 ±) ® Cq(A 2 ±) 


+ 


1 doy 


= c(Aj e t = 2)|<5(A 2= t) + 2 

'a s (QV A 2 ±)\ 4 


00 dA 2 ± 

Ols(Q\/A 2±)\ 2 


(5.16) 


7 r 


/c + (A 2+ ) + /c_(A 2 -) 


+ 2 


+ 


7T 

1 do'ns ] 


/C+(A 2+ ) + /C_(A2-) ® /c + (A 2+ ) + /c_(A 2 -) 


00 


dA 2 ± J 


In Fig. 9, we compare the pure fixed 0(a 2 ^ distributions for A 2 ± and A 3 _ from Eq. (4.33) 
with the final prediction from Eq. (5.16). At small values of A„-t, the dominant differences 
come from the higher-order logarithms included in our final prediction, while at larger values 
of A n ± the running effects from our A n= j--dependent scale choice give most of the difference 
from the fixed-order prediction. Since we do not have a candidate factorization theorem for 
the JVjet = 3 region of phase space, the A 3 _ distribution is really just Eq. (5.16) evaluated at 
A3- = 1 — A2+. 
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Comparison Between Analytic Predictions 



Figure 9. A comparison between analytic predictions for the 7Vj e t distribution. Shown in dashed 
black are the fixed 0(a calculations from Eq. (4.33). Shown in solid blue are our final predictions 
from Eq. (5.16), which include an estimate of higher-order G(a]|) terms as well as A 2= |--dependent 
running of a s . Note that the A 2 _ and A 3 _ axes run backwards. 


6 Looking Towards the LHC 

Given the interesting features of JVj e t and the wide-ranging and robust measurements of jet 
substructure at the LHC [80-82], a natural consideration is the Aj e t distribution in hadronic 
collisions. In this case, it is more convenient to use the original definition of IVjet from Eq. (1.1) 
based on transverse momenta and rapidity-azimuth distances (instead of energies and angles): 

A(jet(PTcut, R) = V PTl Q (PTi,R - PTcut ), (6.1) 

i^ en tPTi,R 

where p TlJ{ = YljPTj &(R ~ Rij) and R %3 = ^ A y}- + A0?.. 

At first glance, the calculation of IVj et at the LHC would seem to be much more difficult 
than at an e + e _ collider. After all, IVj e t depends sensitively on soft radiation, and soft QCD 
is notoriously complicated at a hadron collider. However, IVjet only depends on soft radiation 
in a region of size < 2 R around energetic partons. To the extent that we can exploit color 
coherence at small R, we can make predictions for Nj et by simply knowing the collinear 
functions for quark- and gluon-initiated jets, C q and C g . In particular, the 1 —> 3 splitting 
function formalism that we used in Sec. 4.1 to calculate the collinear function for quarks in 
e + e _ may also be used to calculate the quark and gluon collinear functions for LHC processes. 

Conveniently, when switching between the e + e _ and hadronic definitions of IVj e t, the 
collinear functions C qt3 only differ by terms of (D(R). For small jet radii, one can therefore 
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neglect those corrections and use the e + e _ collinear functions calculated in this paper also 
for the hadronic case, though one has to be careful to match the z cu t value to the outgoing 
parton momentum. To see why this is the case, notice that 


z, = §-^^[l+0(R)}, 

Aj Ptj 


£cut 

■2-cut — . - — 

Aj 


PTcut 

PTJ 


[1 + 0(R)}, 


( 6 . 2 ) 


where ptj is the scalar sum of transverse momenta in a jet region. Thus, the energy integrals 
in Cq )9 would differ at most by O(R) terms. The angular phase space regions in Eq. (4.10) 
would be written as constraints on Rij instead of dij , but notice that 


Rij = Oij (cosh yi coshr/j) 1 / 2 ~ 9ij coshyj[l + O(R)], (6.3) 


where yj is some characteristic rapidity associated with the jet region, for example the rapidity 
of the summed jet region momenta. At leading power, the logarithmic scaling of the angular 
integrals can be used to remove any dependence on coshyj, so that the differences in the 
angular integrals are again at most an O(R) effect. 

In order to extend the candidate dijet factorization formula in Eq. (5.2) to dijet events 
from hadronic collisions, we have to sum over all relevant partonic channels: 

wt— = £ &hih 2 ^ke(Nj et = 2) Cfc(A 2 ±) <8> Q(A 2 ±), (6.4) 

dA 2± 


where k,£ = q, q, or g, and Ai j2 represents the colliding hadrons. In general, <Jhih 2 ^ke, Cfc, and 
Ci depend on the outgoing parton kinematics, though we have suppressed that dependence 
in Eq. (6.4) for readability. Note that the incoming beams only create additional high-p^ 
jets through hard, perturbative emissions. We can write the total rate ah^-^kii^jet = 2) 
schematically as 


»w?(A(jet — 2) — 'y ' 

a,b 


dx a dx b fa/h 1 ( x a)fb/h 2 ( x b ) cr a b^ki{N jet = 2), (6.5) 


where f a /h { and fb/h 2 are parton distribution functions for partons a and b carrying mo¬ 
mentum fractions x a b of the initial hadrons /ii >2 , and (j a b^ke .{^jet = 2) is the total rate at 
Ajet = 2 for the partonic channel ab ki. As in the e + e - case, this cross section can have 
large logarithms of R and PTcut that require resummation to obtain reliable predictions. 

Compared to the e + e _ case, the main new ingredient is the collinear function C S (A 2 ±) 
for a gluon-initiated jet. For completeness, we have calculated the gluon collinear function 
using the same approach as Sec. 4.1, and we present both C q and C g in App. C. The structure 
in Eq. (6.4) can be easily extended to handle the A n ± cross section for n-jet processes by 
using n collinear functions multiplied by the Aj e t = n cross section, appropriately summed 
over the various partonic channels. 

At a hadron collider, there is also a new potential source of non-global logarithms from 
initial state partons. In general, non-global logarithms of A n ± only appear in the collinear 
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functions C q ^ g for each final state jet, and are associated with correlated emissions from final 
state partons. For events with well-separated jets, the collinear functions are universal and 
independent of other jets in the event. However, there are also non-global logarithms of z cu t 
and R which appear both in the collinear functions and in the exact integer iVj e t cross section 
(e.g. <7/^2-^(-^Yjet = 2), see Eq. (5.7)). For the exact integer cross section at a hadron 
collider, one also has to include correlated emissions from initial state partons, which may 
introduce super-leading logarithms [83-85] at sufficiently high a s order. Ideally, one would 
want to understand the resummation of non-global effects in the collinear functions and the 
exact integer cross section to achieve accurate predictions. 

Because a detailed study for the LHC is beyond the scope of this paper, we will not 
present any results for hadronic collisions. An important effect to account for in future 
LHC predictions and comparisons with measurements is hadronization, which we have not 
considered here. Additionally, for hadronic collisions, effects from the underlying event and 
initial state radiation are not present in the e + e _ case. To lowest order in R, we expect those 
effects to be captured by color coherence, and one could imagine using the techniques of 
Ref. [86] to understand the R dependence. A similar concern is pileup contamination, though 
the closed-form nature of Aj e t makes it well-suited to analytic studies using area subtraction 
[87-89]. One could further mitigate the impact of pileup by using a version of Aj et that 
includes jet trimming [90] in closed form [15], and we expect the collinear function for the 
trimmed A r j et version could be calculated using the same techniques used here. 

7 Monte Carlo Comparisons 

We now compare our analytic prediction in Eq. (5.16) with parton shower Monte Carlo gen¬ 
erators. As discussed in Sec. 5.4, the rate er(Aj e t = 2) only enters as an overall normalization 
factor, and it in principle requires resummation of logarithms of R and z CVL t to obtain a reliable 
prediction. Since we are mainly interested in describing only the near-integer (and not the 
exact-integer) behavior, we will divide out by an overall normalization factor and perform 
only a shape comparison. In all of the plots below, we normalize the cross section to the 
region A 2 + E (10“ 4 ,10~ 2 ). 

To make an apples-to-apples comparison of our analytic results to Pythia 8 and Her- 
WIG-I —|-, we turn off hadronization in the Monte Carlo generators. In principle, the collinear 
functions C q and C q should get hadronization corrections, but the non-factorizing nature of 
Ajet means that we cannot adopt a standard shape function analysis [91, 92], We run at a 
sufficiently high energy to allow for a comparison over a wide logarithmic range in A 2 ±. The 
large collision energy we choose, Q = 10 TeV, also mitigates the effect of low energy cutoffs 
(~ 1 GeV) on the parton shower. 

Because the singular terms in our calculation are numerically dominant for small A 2 ±, 
higher-order logarithms may be important in determining the shape of the distribution in this 
regime. Some of the C>(af) effects are captured by the convolution structure in Eq. (5.16), 
but they are not fully reliable. Therefore, we include an uncertainty in our predictions derived 
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Figure 10. A shape comparison between our analytic predictions for the 2 < _ZVj et < 3 distribution 
with the Monte Carlo generators Pythia 8 and HerwigH — h Our calculation is summarized in 
Eq. (5.16) with uncertainties given in Eq. (7.1). To make an apples-to-apples comparison, we include 
showering but not hadronization in the Monte Carlo samples. All cross sections are normalized to 
have the same value in the A 2 + £ (10~ 4 ,10 -2 ) window. 


from the addition of 0(cKg£ 2 (A 2 ±)) and 0( a 4 s C 3 {A 2 ±)) terms with unknown coefficients that 
we vary. These terms represent the leading logarithms at each order, and are an estimate of 
missing higher-order terms. The form we use to determine the uncertainty is 

3 (Y)hw £ 2 ( a 2± ) + (Y)‘4« £ 3 ( a 2± ), (7.i) 

and we vary independently the coefficients in the ranges £ (—5, 5) and £ (—15,15). 

( 2 ) 

These coefficients are of similar size as the leading coefficients in our calculation, and 
the 0(ctf) term generated by the naive exponentiation in Eq. (5.16) has ~ 0(10). The 
uncertainty band is given by the envelope of these variations, including effects both on the 
cross section normalization (where we just add Eq. (7.1)) and on the cross section shape 
(where we add Eq. (7.1) and readjust the normalization in the A 2+ £ (10 ,10 -2 ) window). 

In Fig. 10, we show the shape comparison between our result and Pythia 8/Herwig++ 
over the range 2 < A r j et < 3. The singular cross section for A 3 _ is included as part of 
the nonsingular correction to the A 2+ distribution (see Eq. (5.15)). Overall, we find good 
agreement with the Monte Carlo generators within uncertainties. It is amusing that there 
is such close agreement with Pythia 8 in the lVj et —> 2 region and with HerwigH —\- in the 
iVjet —> 3 region, with the analytic result effectively interpolating between the two. Given 
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Analytic / Monte Carlo Shape Comparison 



Figure 11. Same distributions as Fig. 10, but in triptych form to highlight the near integer behavior. 
Because we normalize only to the A 2+ € (10~ 4 ,10~ 2 ) window, the normalization differences in the 
A 2 _ region are accentuated; the shape agreement is much better. For the A 3 _ distribution, we extend 
our central prediction (with a dotted blue line) to A 3 _ < (z cut /2) 2 , where genuine 3-jet events, whose 
contribution we have not calculated, contribute to the observable. Note that the A 2 _ and A 3 _ axes 
run backwards. 


that the A 2+ and A 3 _ parts of the distribution are dominated by different phase space 
configurations (see Table 1), it is possible that we are seeing the impact of different shower 
ordering variables in Pythia 8 (pj_-ordered) versus HerwigH — b (angular-ordered). Of course, 
the theoretical uncertainties in our calculation are too large to make a definitive statement. 

In Fig. 11, we compare distributions of A 2 ± over the range (10 —4 ,1), using the triptych 
format to also see the A 3 _ region. Again, our analytic prediction reproduces the shape of the 
Pythia 8 and Herwig-I— b distributions remarkably well over the whole range. The Monte 
Carlo distributions, which include higher-order logarithms of A 2 ± through multiple emissions, 
generally lie within the uncertainty band of our prediction for IVj e t > 2. This indicates 
that we have made a reasonable estimate of the higher-order corrections not included in our 
calculation. Because we normalize to the window A 2+ 6 (10~ 4 ,10~ 2 ), it is not surprising that 
there is a normalization discrepancy in A 2 _, though one can see that the shape agreement 
for IVjet < 2 is excellent. We suspect that the normalization issues for IVjet < 2 may be 
due to the treatment of correlated soft radiation (and angular ordering) in the Monte Carlo 
generators, since the A 2 _ distribution is dominated by double-soft emissions with a CfCa 
color structure. The overall shape agreement between Pythia 8 and HerwigH- + is quite 
surprising, given that they exhibit a factor of 3 difference (not shown) in their predicted value 
of cr(iVj e t = 2), suggesting that the shape of the non-integer distributions is more robust than 
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the cross section at integer values. 


8 Conclusions 

In this paper, we studied the analytic properties of iVj et , a jets-without-jets event shape that 
can return a fractional value of jet multiplicity. Focusing on e + e _ —> jets, we calculated the 
distribution of IVj e t in the vicinity of a dijet configuration at C(a^). A fractional number of 
jets requires at least three collimated partons, such that for e + e~ —> jets, the first non-trivial 
contribution requires at least two emissions. The singular parts of this emission structure can 
be captured using 1 —> 3 splitting functions, and we validated this approach (and included 
nonsingular contributions) using the fixed-order code Event2. To partially capture higher- 
order effects out to 0(cg)), we included convolutions from different phase space regions and 
running coupling effects. We found very good agreement between our calculation and Monte 
Carlo distributions from both Pythia 8 and HerwigH— h 

Fractional jet multiplicity exhibits unique analytic features that are not shared by other 
jet observables. At O(ct^), we showed how rapidity-like divergences, related to the energy 
sharing between emissions, appear when one or two partons become soft, and we explained 
how to regulate them. Beyond the fixed-order result, we proposed a candidate local factoriza¬ 
tion theorem and used it to predict a hybrid jet algorithm/event shape behavior for the IVj e t 
distribution in the vicinity of an integer. At exact-integer values, IVj et behaves much like a 
standard jet algorithm, yielding spikes in the IVjet cross section. The near-integer behavior of 
IVjet is more characteristic of event shapes, where towers of higher-order logarithms give rise 
to (single-logarithmic) suppression of the singular phase space, yielding shoulders in the IVjet 
cross section. Finally, as opposed to typical event shapes, collinear emissions do not generate 
logarithms of IVjet, so the shape of the near-integer distribution is entirely determined by 
soft logarithms. These soft logarithms are purely non-global, as near-iuteger values force soft 
partons to lie in a restricted angular region of phase space and correlate different emissions 
(even if generated from an Abelian matrix element). 

Beyond our present understanding of IVj e t , there are three key directions to pursue. The 
first is to extend our calculations to O(a^). Though the differential cross section shapes 
were largely within our uncertainty estimates, the normalization differences seen between our 
analytic calculations and Monte Carlo generators suggest that higher-order terms might be 
relevant. While we were able to estimate some effects through convolutions, there 

are genuine 0(a (?) effects that may get a phase space enhancement in the merged jet region 
(relative to the 0(a )?) phase space) to partially overcome the a s suppression. The second is 
to attempt an understanding of logarithmic resummation. Because of the non-global nature 
of the observable, standard renormalization group methods will not work, but there may be 
a way to exploit the rapidity-like divergences seen in Sec. 3 or the techniques introduced in 
Refs. [93, 94], The third is to perform a detailed study for hadron colliders like the LHC, as 
discussed in Sec. 6, including the important effect of hadronization. Fractional jets offer a 
more nuanced understanding of jet formation than is possible with standard jet algorithms, 
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and a measurement of A(j et seems feasible given the increasingly sophisticated approach to jet 
physics at the LHC. 
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A Results for Non-Abelian Contributions 


Following the Abelian results in Sec. 4.1, here we show the singular C F C A and C F T}inj 
contributions to the A 2 ± and A 3 _ cross sections. The CfCa terms are 
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where Iq is defined in Eq. (4.29), and the remaining angular integrals can only be done 
numerically: 


4 a) = -2.44393, 


= -0.035397. 


The C F T F rif terms are 
dcr 


dA 2-J C F T R n f 
da 
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CpT R nf 


= (^) 2 C F T R n / 4 c) £ o (A 2 _)0(A 2 _ < ^ ut /4) 
= C F T R n f X^C o(A 2+ ), 
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(A.6) 

(A.7) 


The C F T R nf contribution to A 3 _ is purely power suppressed. 
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B Properties of Distributions 


In this appendix we collect useful formulae for the convolution of one-sided distributions, 
and then discuss two-sided distributions and their convolutions. The relations given here are 
straightforwardly derived from the results and techniques in Ref. [95]. 

Throughout this paper, we use the standard plus distribution notation. For a function q, 


[?(*)]+ = l im j- [©(* - P)Q{x)\ , Q(x) = [ d x'q(x'). (B.l) 

/3->o ax J i 


This distribution has a boundary at 1, so that 


dx [g(x)]+ = 0 . 


(B.2) 


We also use the shorthand 

- , , 

C n [x) = 


0 (x) 


for n an integer > 0 , £_i(x) = 5(x ), (B.3) 


as well as the distribution 


1 _ _ 1 

[0(x) x~ 1+/3 ] = -S(x) + ^y/3 n £n(z) 

P n= 0 U ' 


(B.4) 


The [oo] notation on the plus distribution indicates that the boundary is at oo (instead of 1), 
so that 


[0(x) x 1+/3 ] |°°^ = 0. 


(B.5) 


B.l One-Sided Distributions and their Convolutions 


For many applications, one often makes use of convolutions between plus distributions, espe¬ 
cially C n . The convolution is defined as 

/ OO 

dx'd x" 6(x — x' — x") fix') g(x "). (B.6) 

-OO 

We can take the Fourier transform T to make the convolution multiplicative, and also use 
the Fourier transform and its inverse to determine a convolution: 


F{f®g) =-£{/}-£{<?} => f®g = J 7 ~ 1 {J 7 {f}J r {g}}. (B.7) 

It is straightforward to determine the convolution between two plus functions Ck and C n 
by relating it to Eq. (B.4): 

(£fc ® £ n ) (x) = the coefficient of ^ [©(x) x _1+Q ] <S> ^ [0(x) x _1+/3 ] . 

' " ' (B.8) 
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Using the Fourier transform 


,F{[©(x)x W/3]M} =r[#|(-is) p , 


(B.9) 


with s as the conjugate variable, it is straightforward to show 

v ,k on s 


(C k <S> C n ) ( X ) = the O coefficient of [©(») x 1+ “+^] ^ (B.10) 
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This approach is similar to the one used in Ref. [95] to give the general form of the convolution 
(£jb <8> C n )(x), where an equivalent result to Eq. (B.10) is given. 

We give the first few convolutions here for convenience: 


Cq{x) <g> Cq(x) = 2£i(x) - ^-<5(x), 

o 

3 7T 2 

Co{x) <g> Ci(x) = -C 2 (x) - —Cq{x) + C 3 ^(x) , 

2 4 

Ci{x) <g> Ci(x) = C 3 (x) - ' K -Ci{x) + 2C3 £o(x) - ■ 

3 3bU 

These distributions only have support for x > 0, and appear in many applications. 


(B.ll) 


B.2 Two-Sided Distributions and their Convolutions 


In this work, we encountered observables whose cross sections have support for all x and 
behave like distributions as x —> 0 . We will refer to them as two-sided distributions (one 
may think of the usual distributions as one-sided), and we now discuss convolutions of them. 

Consider an observable x with singular behavior for x —> 0^. The fixed-order singular 
behavior of x is described by distributions C n (x±), where x + = x for x > 0 and x_ = — x for 
x < 0, so that 


£-n(x + ) 

Cn(x-) 


£n(x) = 
CJ-x) = 


In” x 


0(x) 

X 

In n (-x) 
—x 


0 (-x) 


(B.12) 


Convolutions between various C n (x±) will mix the x± distributions. 

First, we note that convolutions between two x+ distributions or two x_ distributions 
are effectively one-sided, meaning the above results can be used without modification. Con¬ 
volutions between an x_|_ and an x_ distribution are the novel ones we derive here, adapting 
the technique in App. B.l to find the general form of two-sided convolutions. 

Using the Fourier transforms of the x± distributions 

^{[0(x ± )xi 1+ ^]f ] } =r(P)teis)-f > , (B.13) 
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we have 
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We can easily Fourier invert the right-hand side given Eq. (B.13). This implies 
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Using this relation, the first few nontrivial convolutions for two-sided distributions are 

7r 2 
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1 7 r 2 
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+ \^{ x ~) + y£i(.T_) + C 3 A)(z-) + Y^ 5 ( x ) • ( B - 17 ) 


C Quark and Gluon Collinear Functions 

We summarize here results for the collinear functions for quark- and gluon-initiated jets 
through 0(a 2 ). We express our results for dijet observables (A 2 ± and A 3 _), but they can be 
applied to the more general n-jet case of A n ± as well. The quark collinear function can be 
derived from the calculations in Sec. 4.1 and App. A. The gluon collinear function calculation 


- 39 - 















proceeds in analogy with the quark case, and there is a strong relation between the quark 
and gluon results for nearly all coefficients. 

To 0(ag), the quark and gluon collinear functions contribute to A 2 ± and A 3 _, with 

(C.l) 
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C'«(A 2+ ) = 5(A 2+ ) + (^) 2 /C« (A 2+ ), 
a(A 3 -) = (^) 2 /C«(A 3 -), 

where i = q,g for the different parton types, and the subscripts on indicate the contri¬ 
bution of an individual jet region to lVj e t- The ©(a 2 ) terms are given by 
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and 
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The angular integral factor Xq is defined in Eq. (4.29) and x^’ b,c ’ d ^ are given in Eqs. (A.4) 
and (A.7). 
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